PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA03g34750
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family HD-ZIP
Protein Properties Length: 775aa    MW: 86788.4 Da    PI: 7.0986
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA03g34750genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.95.4e-21110165156
                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                 r+k +++t +q++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  CA03g34750 110 RKKYHRHTVQQIREMEALFKESPHPDEKQRQQLSKQLGLHPRQVKFWFQNRRTQIK 165
                 7999************************************************9877 PP

2START208.72.2e-652845093206
                 HHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEE CS
       START   3 aeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv........dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevi 85 
                  ++a+++l k+a+ +ep+W +s     e++n+de++++f+  +           +ea+r++g+v+m+l++l+++++d++ qW+e+++    ka+t++vi
  CA03g34750 284 VNQAMEQLKKMATCGEPLWIRSFetgrEILNYDEYMKEFPLMEKsgdvkskrMCIEASRETGIVFMELPRLLQTFMDVN-QWKEMFPsmisKAATVDVI 381
                 57899*****************99***************876657889999999*************************.******************* PP

                 CTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSX CS
       START  86 ssg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrl 176
                 ++g       ga+qlm+ae q+l+p+v  R+++fvRy++q ++g+w ivdvSvd  +++  ++s+v++++lpSg+++++ sn ++kvtwveh ++++ +
  CA03g34750 382 CNGeganswdGAVQLMFAEVQMLTPVVGtREVYFVRYCKQIRGGQWGIVDVSVDKVEHNI-DASLVKCRKLPSGCILQEQSNARCKVTWVEHLECQKGI 479
                 ***********************************************************8.9************************************* PP

                 XHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
       START 177 phwllrslvksglaegaktwvatlqrqcek 206
                 +++l+r++v+sg+a+ga++w+atlq+qce+
  CA03g34750 480 VDSLYRVIVNSGQAFGARRWMATLQQQCER 509
                 ****************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.609.8E-2294161IPR009057Homeodomain-like
SuperFamilySSF466891.75E-2097168IPR009057Homeodomain-like
PROSITE profilePS5007117.799107167IPR001356Homeobox domain
SMARTSM003891.9E-18109171IPR001356Homeobox domain
PfamPF000462.8E-18110165IPR001356Homeobox domain
CDDcd000861.90E-16114165No hitNo description
PROSITE patternPS000270142165IPR017970Homeobox, conserved site
PROSITE profilePS5084835.948273512IPR002913START domain
SuperFamilySSF559619.45E-33275509No hitNo description
CDDcd088754.57E-101277508No hitNo description
SMARTSM002344.4E-62282509IPR002913START domain
PfamPF018529.6E-53284509IPR002913START domain
Gene3DG3DSA:3.30.530.203.7E-7337508IPR023393START-like domain
SuperFamilySSF559616.81E-13531735No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 775 aa     Download sequence    Send to blast
MVVVDMSNNP PPPSHETKDF FPSPALSLSL AGIFRDGGGE GNSAGNMETM EEVGDESKGG  60
RPREETSTVE ISSENSEPMR SRGSDDDLEH DDTCNEDKED PNNNSRKKKR KKYHRHTVQQ  120
IREMEALFKE SPHPDEKQRQ QLSKQLGLHP RQVKFWFQNR RTQIKAIQER HENSLLKAEI  180
EKLREENKGL REISKNPTCP NCGFASSSNN DPRVPAEEQQ LRIENARLRA EVEKLRAALG  240
KYPLGASPNS SSSYSGGHDE ENKSALDFYT GIFGLEKSRI MHVVNQAMEQ LKKMATCGEP  300
LWIRSFETGR EILNYDEYMK EFPLMEKSGD VKSKRMCIEA SRETGIVFME LPRLLQTFMD  360
VNQWKEMFPS MISKAATVDV ICNGEGANSW DGAVQLMFAE VQMLTPVVGT REVYFVRYCK  420
QIRGGQWGIV DVSVDKVEHN IDASLVKCRK LPSGCILQEQ SNARCKVTWV EHLECQKGIV  480
DSLYRVIVNS GQAFGARRWM ATLQQQCERL LFFMATNIPT KDTPGVATLA GRKSILTLAQ  540
RMTWSFYRML GASSYNTWNK VPSKTGQEDI RVASRKNLTD PGEPLGLILC AVSSIWLPVS  600
RNVLFDFLKD ENRRHEWDVM SNGGPVQSVA NLAKGQDKGN AVSIQVSISR RRRENMWILQ  660
DTCTNAYESA VVYAPVDIAG MQSVITGCDS SNIAMLPSGF SILPDGLESR PFVITSKPED  720
RSSEGGSLLT VAFQILTSSS PTAKLSKESI ESINNLLSCT LHKIKASFQC DNGY*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1105111RKKKRKK
2107111KKRKK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755151e-141HG975515.1 Solanum lycopersicum chromosome ch03, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016562690.10.0PREDICTED: homeobox-leucine zipper protein GLABRA 2
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLA0A0V0IS600.0A0A0V0IS60_SOLCH; Putative homeobox-leucine zipper protein GLABRA 2-like
STRINGSolyc03g120620.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA62282328
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein